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CREATING MULTI-PAGE DOCUMENTS USING TIFF FILES 

Reference to Provisional Application 

The present application claims priority from US Provisional Application 
5 Serial No. 60/168,293, filed December 1, 1999. 

Field of the Invention 

The present invention is directed to a method for representing a multi- 
page document using a hierarchically-organized set of TIFF files. 

10 

Background of the Invention 

"TIFF-FX" is a proposed standard for the rendering and retention of image 
data. It is useful for transmission of facsimile-format documents over the 
Internet, and encompasses other standards such as JPEG, JBIG, and color fax 
15 standards. One aspect of TIFF-FX is that there is a special problem with 
rendering multi-page documents, and/or page images having multiple 
components (such as combinations of text, contone images, and line art) in a 
coherent format. 

In TIFF-FX, different types of image components (text, line art, contone) 
20 can be compressed in various ways, such as JBIG, JPEG, or fax formats. The 
different compression arrangements or schemes are called "profiles." Examples 
of profiles are: 

S= b/w, simple compression algorithm 
F= b/w, richer compression algorithm 
25 J= b/w, JBIG compression 
C= color JPEG compression 
L= color JBIG compression 



M= MRC = "mixed raster content" = in each page, different components are 
compressed in different ways. Different components of a page image are 
organized as "mask," "upper," and "lower," which are ultimately combined to 
create a single, multi-component page image. Typically, the "mask" is text, 
compressed in binary, JBIG, or the fax compressions Modified Huffman, 
Modified Read, or Modified Modified Read. The "lower" portion is typically 
contone images compressed in JPEG. The "upper" portion is typically line art 
compressed in GZIP. 

The present invention is directed to a system for organizing image data in 
a heterogeneous form, such as including both color and monochrome images, or 
images compressed according to different schemes, so that a TIFF-FX writer can 
automatically organize the data to create a single multi-page document. 

Description of the Prior Art 

U.S. Patent 5,706,457 discloses a system for acquiring and archiving 
images derived from multiple sources. An operator of the system can perform 
only a predetermined set of functions corresponding to graphical icons. Each of 
the icons launches a set of macro functions that format the image data into a 
predetermined format. 

U.S. Patent 5,724,579 discloses a system for producing "subordinate 
images" extracted from our original image data. The subordinate image data 
can be images directed to a portion of the original data, or subset of the original 
data making a thumbnail of the original data. A first subordinate image is 
extracted from original image data, and a second subordinate image is in turn 
extracted from the first subordinate image data. The main image data and the 
first and second subordinate image data are stored in the same file. 

U.S. Patent 6,052,198 discloses a system for organizing files associated 
with a single job ticket, such as in a digital printing context. The job ticket 
includes information on print files included in a print job, print file location 
information indicating a location of print files in a storage device, and information 
indicating a location of a rasterized of version of a print file in the storage device. 



When the job ticket is submitted to a printing apparatus, a rasterized version of 
the data is submitted instead of the original print file if the rasterized version was 
modified after the print file was modified. 



5 Summary of the Invention 

According to one aspect of the present invention, there is provided a 
method of organizing image data to create a multi-page document, comprising 
the steps of naming each of a set of files, each file representing either a page 
image or an image component of a page image, according to a naming 
10 convention, organizing the files into a hierarchical directory structure, and 
applying a writer application which recognizes the files by the naming convention 
to create a single, multi-page document. 

Brief Description of the Drawing 

15 Figure 1 describes an input file organization according to an embodiment 

of the present invention. 



Detailed Description of the Invention 

According to the present invention, a naming convention and directory 
20 structure is used to identify individual page images and/or page image 
components within a multi-page document, so that a multi-page, multi- 
component "source file" can be created. The basic approach to converting many 
single page TIFF files into a single TIFF-FX file is to: (1) organize the original 
TIFF files into a specified architecture (by a combination of naming convention 
25 and directory structure); and (2) execute a known TIFF-FX writer application to 
convert the set of TIFF files into the TIFF-FX file. 

The input data to the writer must be in a particular hierarchy on a disk to 
be properly handled. Figure 1 describes the input file organization. It can be 
seen in the Figure that a TIFF-FX writer can recognize a single page input file, a 
30 directory of input files (for a multi-page document of simple pages), or as a 
directory of directories (for a multi-page document wherein some or all pages 



3 



have multiple components, as described above). Quality Logic (formerly Genoa 
Systems) currently sells a product "TIFF-FXpert Test System" used to evaluate 
TIFF-FX files: this product can be used as a writer within the context of the 
present invention. 

5 According to one embodiment of the present invention, there are three 

modes to the TIFF-FX writer. If the "source argument" (the name of the file 
desired to be considered a single document) is a simple file name, then a single 
page TIFF-FX file will be generated. In such a case, any profile may be 
requested except the MRC profile, M. if the source argument is a directory of 

10 files, then a multi-page TIFF-FX document is generated. According to the 
convention of one embodiment, each file in the directory must have a file name 
"PageN" where N is a page number starting with 1. Source files not obeying this 
convention are ignored. Once again, in this case any profile may be requested 
except the MRC profile, M. 

15 To support the MRC profile, M, the source argument, may represent a 

directory of page directories. Each file must contain at least three files, which, in 
one convention, are named Mask, Lower, and Upper, corresponding to the roles 
described above in MRC profile layers. To test profile M, input data follow this 
format. All other files will be ignored. 

20 According to one alternate embodiment, there may also be included, in 

the hierarchy, an "info" or "directive" file, which contains data relating to at least 
some of the other files within the same directory. This "info" file could include 
instructions that, for instance, the text in the mask within the same directory 
should be compressed in a specific way, such as in G3 format, or the contone 

25 data must be compressed in JPEG; also, the info file can specify a particular 
quality level for the compression algorithm. 

In a preferred embodiment, all source files should conform to TIFF6 
(baseline + standard extensions) specifications. 

With all source images in the format described above, the TIFF-FX writer 

30 can proceed to read original data in various formats and emit the hierarchically- 
organized TIFF-FX files. 
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Although a TIFF-FX implementation is shown here, the basic principle can 
be applied to the creation of other multi-page document formats. 

The present invention simplifies the testing and debugging of TIFF-FX 
images. TIFF-FX files can potentially represent many pages of image data, each 
5 page being quite complex (i.e., profile M). Real applications may require 
significant additional processing (e.g., segmentation of an image into 
Foreground, Background, and Mask layers). This representation allows 
separation of the development of segmentation algorithms from the development 
of the TIFF-FX writers/readers, and defines a common means by which 
10 developers can interchange test data. The present invention can be used to 
convert existing repositories of document data into TIFF-FX files. Scripts can be 
constructed that would take existing repositories and convert them into the 
appropriate hierarchy, then a TIFF-FX writer would generate the TIFF-FX files. 
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CLAIMS: 



1. A method of organizing image data to create a multi-page 
document, comprising the steps of: 

naming each file of a set of files, each file representing either a page 
image or an image component of a page image, according to a naming 
convention; and 

organizing said files into a hierarchical arrangement; and 
applying a writer application which recognizes the files by the naming 
convention to write a single file, multi-page document. 

2. The method of claim 1, the naming step including providing a file 
within the set of files with a simple file name, the simple file name causing the 
writer application to generate a single page file. 

3. The method of claim 1, the naming step including naming a file within 
the set of files as a directory of source files, the directory causing the writer 
application to generate a multi-page document. 

4. The method of claim 1, wherein each file in the directory is named 
according to a naming convention which identifies each file within the directory 
as relating to a page. 
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5. The method of claim 1, the naming step including naming a file within 
the set of files as a directory of page directories, the directory causing the writer 
application to generate a multi-page document with each page directory 
corresponding to a page. 

6. The method of claim 5, wherein each page directory includes a 
plurality of files, each file corresponding to a page image component for a single 
page image. 

7. The method of claim 6, wherein the page image components are MRC 
profile layers in TIFF-FX. 

8. The method of claim 1, further comprising the step of including, 
among the set of files, a directive file, the directive file being readable by the 
writer application and instructing the writer application to process another file in 
the set of files in a predetermined manner. 

9. The method of claim 8, wherein the predetermined manner relates to 
selecting a compression technique. 

10. The method of claim 8, wherein the predetermined manner relates to 
selecting a quality level for a compression technique. 

11. The method of claim 1, wherein a first file in the set of files is 
compressed according to a first compression scheme, and a second file in the 
set of files is compressed according to a second compression scheme. 

12. The method of claim 1, wherein a first file in the set of files relates to 
a monochrome image, and a second file in the set of files relates to a color 
image. 
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ABSTRACT 

A system for organizing a heterogeneous set of page image data, such as 
including both simple-compressed and JPEG compressed, and/or color and 
monochrome data, so that the data can be organized as a single, multi-page, 
TIFF-FX document. Different page images and image components are named 
according to a simple convention. When a TIFF-FX writer application is applied 
to the data, the writer application responds to the names of the files and the file 
hierarchy to write a single, multi-page TIFF-FX document. 



8 



7/7 



05 

O 

a> 

Q 

o 

© 

U 



X 

fX 

i 

IX, 
IX, 



.x a 



o 



0> 

o 

PL, 



en 
<3 



o 



5-h 

Oh 
Ph 



o3 



s ^ 

« I — I 

i O 

go 



Iff 



o 



a 



© 



IX 
IX 



.x S 
x^ 2 



u 

GO 

o 

5-H 

CX 



i 

GO 





rsi 


CO 


















03 


IX, 


Oh 


Oh 



ft 

a 

'3d 



X 

1 

PX 
(X 



03 a 
sx <3 

• ^ o 
00 -cJ 



P-T 
o 

J3 

o 

PLh 



1 

z 

o 
GO 



illl 



PATENT APPLICATION 

Attorney Docket No. d/99807 

DECLARATION AND POWER OF ATTORNEY FOR PATENT APPLICATION 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name, 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and joint 
inventor (if plural names are listed below) of the subject matter which is claimed and for which a patent is 
sought on the invention entitled: METHOD OF CREATING MULTI-PAGE DOCUMENTS FROM TIFF 
IMAGES 

the specification and claims of which 

[§] are attached hereto OR □ was filed on as U.S. Application No. 

I hereby state that I have reviewed and understand the contents of the above-identified specification, including 
the claims. 

I acknowledge the duty to disclose information which is material to the patentability as defined in Title 37, 
Code of Federal Regulations, §1 .56. 

I hereby claim priority benefits under Title 35, United States Code, §119 of any foreign or U.S. Provisional 
application(s) for patent listed below, and have also identified below any foreign application(s) or Provisional 
application(s) for patent having a filing date before that of the application on which priority is claimed: 

Prior Foreign or U.S. Provisional Application(s) 

60/168,293 USA 01/12/1999 

(Number) (Country) (Day/Month/Year Filed) 



POWER OF ATTORNEY: As a named inventor, I hereby appoint the following registered practitioners 
to prosecute this application and transact all business in the Patent and Trademark Office connected 
therewith. 



John E. Beck 
Ronald F. Chapuran 
Mark Costello 
Richard B. Domingo 



Reg. No. 22,833; 
Reg. No. 26,402; 
Reg. No. 31,342; 
Reg. No. 36,784; 



ADDRESS ALL CORRESPONDENCE TO: 
John E. Beck 
Xerox Corporation 
Xerox Square 20A 

Rochester, New York 14644 



Henry Fleischer Reg. No. 25,582 

Eugene O. Palazzo Reg. No. 20,881 

Denis A. Robitaille Reg. No. 34,098 

Robert Hutter Reg. No. 32,418 

DIRECT TELEPHONE CALLS TO: 
(name and telephone number) 
Robert Hutter 
716-423-3811 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made 
on information and belief are believed to be true; and further that these statements were made with the 
knowledge that willful false statements and the like so made are punishable by fine or imprisonment, or both, 
under §1001 of Title 18 of the United States Code and that such willful false statements may jeopardize the 
validity of the application or any patent issuing thereon. 
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DECLARATION AND POWER OF ATTORNEY, continued 

Name of sole or first inventor: Dennis L. Venable * 
Inventor's Signature: 

Residence: 4353 Dormedy Hill Ffoad, MarionfNew York 1 4505 

Citizenship: USA Post Office Address: 

(Same as above) 
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